Upcast gradually when computing variance #4283

ftynse · 2025-07-25T11:19:45Z

Going all the way to f64 is undesirable, especially for low-precision tensors in bf16 or f8 variants. Upcast only to the next type, e.g., bf16->f32 or f8->bf16. This is consistent with what Pytorch seems to be doing internally.

Going all the way to f64 is undesirable, especially for low-precision tensors in bf16 or f8 variants. Upcast only to the next type, e.g., bf16->f32 or f8->bf16. This is consistent with what Pytorch seems to be doing internally. Signed-off-by: Alex Zinenko <git@ozinenko.com>

qedawkins

LGTM as long as CI is happy

vivekkhandelwal1 · 2025-08-11T04:37:24Z

lib/Dialect/Torch/Transforms/DecomposeComplexOps.cpp

+  // Upcasting the input tensor to a double-bitwidth dtype for higher precision
+  // during the computation of the result.
+  unsigned bitwidth = inputTensorTy.getDtype().getIntOrFloatBitWidth();
+  if (bitwidth != 64) {
+    Type targetTy = rewriter.getF64Type();
+    if (bitwidth == 8)
+      targetTy = rewriter.getBF16Type();
+    else if (bitwidth == 16)
+      targetTy = rewriter.getF32Type();
    self = convertTensorToDtype(rewriter, loc, self, rewriter.getF64Type());


Don't you need to replace rewriter.getF64Type() with the targetTy here?

ftynse force-pushed the users/ftynse/dont-upscale-variance branch from b661b9c to 5b0479c Compare July 30, 2025 13:13

ftynse changed the title ~~Don't upscast when computing variance~~ Upcast gradually when computing variance Jul 30, 2025

ftynse requested a review from qedawkins July 30, 2025 13:22

ftynse marked this pull request as ready for review July 30, 2025 13:22

qedawkins approved these changes Jul 31, 2025

View reviewed changes

ftynse merged commit ac4657a into main Aug 6, 2025
3 checks passed

ftynse deleted the users/ftynse/dont-upscale-variance branch August 6, 2025 19:51

vivekkhandelwal1 reviewed Aug 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Upcast gradually when computing variance #4283

Upcast gradually when computing variance #4283

Uh oh!

ftynse commented Jul 25, 2025 •

edited

Loading

Uh oh!

qedawkins left a comment

Uh oh!

Uh oh!

vivekkhandelwal1 Aug 11, 2025

Uh oh!

Uh oh!

Upcast gradually when computing variance #4283

Upcast gradually when computing variance #4283

Uh oh!

Conversation

ftynse commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qedawkins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vivekkhandelwal1 Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ftynse commented Jul 25, 2025 •

edited

Loading